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ALLOSTERICALLY REGULATED RIBOZYMES 

FIELD OF THE INVENTION 

The present invention relates generally to the field of 
ribozymes and in particular to aptazymes or allosteric, 
regulatable ribozymes that modulate their kinetic parameters 
in response to the presence of an effector molecule. 



BACKGROUND OF THE INVENTION 

The United States Government may own certain rights in 
this invention under DARPA Grant No.: N65236-98-1-5413 and 
MURI Grant No.: DAAD19-99-1-0207 . 

This patent claims priority from Provisional Patent 
Application Serial No. 60/212,097, entitled "Aptazymes for 
Genetic Regulatory Circuits", filed June 15, 2000. 

Ribozymes or RNA enzymes are oligonucleotides of RNA that 
can act like enzymes by catalyzing the cleavage of RNA 
molecules. Generally, ribozymes have the ability to behave 
like an endoribonuleases . The location of the cleavage site 
is highly sequence specific, approaching the sequence 
specificity of DNA restriction endonucleases . By varying 
conditions, ribozymes can also act as polymerases or 

1 JI dephosphorylases . 

Ribozymes were first described in connection with 
Tetrahymena thermophilia . The Tetrahymena rRNA was shown to 
contain an intervening sequence (IVS) capable of excising 
itself out of a large ribosomal RNA precursor. The IVS is a 
catalytic RNA molecule that mediates self-splicing out of a 
precursor, whereupon it converts itself into a circular form. 
The Tetrahymena IVS is more commonly known now as the Group I 
Intron . 
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Regulatable ribozymes have been described, wherein the 
activity of the ribozyme is regulated by a ligand binding 
moiety. Upon binding the ligand, the ribozyme activity on a 
target RNA is changed. The ligand-binding portion is an RNA 
sequence capable of binding a ligand such as an organic or 
inorganic molecule, or even a prodrug. The regulatable 
ribozymes described to date target bind, e.g., a first target 
sequence and the enzymatic activity is brought to bear on a 
separate RNA molecule for cleavage. 
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SUMMARY OF THE INVENTION 

The present invention provides an allosterically 
regulatable ribozyme or aptazyme that has the advantage of 
specific gene recognition with modulation of the enzymatic 
activity of the gene product typically exploited by 
pharmaceuticals. The aptazymes of the present invention are, 
therefore, allosteric ribozymes in that their activity is 
under the allosteric control of a second portion of the 
ribozyme. Just as allosteric protein enzymes undergo a change 
in their kinetic parameters or of their enzymatic activity in 
response to interactions with an effector molecule, the 
catalytic abilities of the regulatable aptazymes may similarly 
be modulated by an allosteric effector (s). Thus, the present 
invention is directed to allosterically regulatable aptazymes 
that transduce molecular recognition into catalysis. 

The present invention includes an allosterically 
regulatable aptazyme construct that is inserted into a gene of 
interest, e.g., a gene targeting expression vector. The 
regulatable aptazyme sequence provides gene specific 
recognition as well as modulation of the aptazyme' s kinetic 
parameters. The kinetic parameters of the regulatable 
aptazyme vary in response to an allosteric effector molecule. 
Specifically, in the presence of the allosteric effector, the 
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aptazyme splices itself out of the gene in response to the 
effector molecule to regulate expression of the gene. An 
important feature of the present invention is that the 
regulatable aptazymes disclosed herein only require 
recognition rather than actual binding. 

A key distinction with known systems is that the 
regulatory domains of the regulatable aptazymes of the present 
invention may bind targets, but they are engineered and 
selected without the necessity of knowing anything about their 
binding target. In the present invention it is the allosteric 
1 interaction of an effector molecule with the regulatory domain 

| that transduces interactions into catalysis. Therefore, 

1 binding is a concomitant but secondary function of such 

interactions; that is, the allosteric ribozymes disclosed 
herein may bind the effector or the target very poorly, but 
upon their interaction, a synergistic effect is found that 
could not be detected by screening for each characteristic 
alone . 

in the present invention the effector molecule does not 
produce a conformational change, but rather will add essential 
catalytic sites (e.g., residues) for a reaction. That is, 
both the effector molecule and the regulatable aptazyme 
contribute a portion of the active site of the ribozyme. For 
example, using the method of the present invention a ribozyme 
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and an effector molecule that would be unable to bind and/or 
perform an enzymatic function independently, may be isolated 
that act synergistically. 

The present invention includes an aptazyme construct with 
a regulatable aptamer oligonucleotide sequence having a 
regulatory domain, such that the kinetic parameters of the 
aptazyme on a target gene vary in response to the interaction 
of an allosteric effector molecule with the regulatory domain. 

The construct of one embodiment of the present invention 
provides a DNA oligonucleotide coding for an aptazyme domain 
so that the DNA can be transcribed to RNA (e.g., mRNA) , where 
the RNA contains a self-splicing aptazyme domain that can be 
activated in the presence of an effector molecule. 

Aptazymes are more robust than allosteric protein enzymes 
in several ways: (D they can be selected in vitro, which 
facilitates the engineering of particular constructs; (2) the 
levels of catalytic modulation are much greater for aptazymes 
than for protein enzymes; and (3) since aptazymes are nucleic 
acids, they can potentially interact with the genetic 
machinery in ways that protein molecules may not. 

At least part of the utility of the present invention is 
for use in the identification, isolation and enhancement of 
allosteric effectors and of the allosterically regulatable 
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aptazymes with which they interact. Similarly, it is possible 
to activate or repress a reporter gene (e.g., luciferase) 
containing an engineered intron in response to an endogenous 
activator. In this way, lucif erase-engineered intron 
constructs may be used to monitor intracellular levels of 
proteins or small molecules such as cyclic AMP. Such 
applications may be used for high-throughput screening. 

The present invention includes a regulatable aptazyme 
construct having an aptamer oligonucleotide sequence with a 
regulatory domain. A characteristic of the regulatable 
aptazyme construct of the present invention is that the 
kinetic parameters of the aptazyme vary in response to an 
effector molecule. In particular, the kinetic parameters of 
the aptazyme on a target gene vary in response to the 
interaction of an allosteric effector molecule with the 
regulatory domain. For example, the aptazyme splices itself 
out of a gene in response to the effector molecule interacting 
with the regulatory domain of the aptazyme to regulate 
expression of the target gene. 

The present invention also contemplates that the 
regulatable aptazyme construct may be amplified by polymerase 
chain reaction. Finally, the invention contemplates that the 
regulatable aptamer oligonucleotide sequence of the construct 
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may include RNA nucleotides, so that the invention further 
includes reverse transcription of the RNA using reverse 
transcriptase to produce an DNA aptamer complementary to the 
RNA template. 

Aptazymes, or allosterically activated ribozymes, have 
been developed that are activated by cyclic nucleotide 
monophosphates as well as other small molecules like 
theophylline. In addition to aptazymes activated by small 
molecules, there are natural ribozymes that are extremely 
dependent on proteins for their activity. 

Other methods for aptazyme development using small 
molecule ligands have proven successful. In particular, it 
has been possible to add aptamer moieties to ribozymes, 
without selection, and achieve activation in the presence of 
ligand (like ATP or theophylline) . One of the unique features 
of the present selection protocol relative to others that have 
previously been published is that the present invention 
randomizes not only a domain that is pendant on the catalytic 
core, but a portion of the catalytic core itself. 

It should be noted that the method is not limited to RNA 
pools, but could also encompass DNA pools or modified RNA 
pools. The method is not limited to ligases, but could also 
encompass other ribozyme classes. 
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Nucleic acids are generally less robust than enzymes. 
However, modified nucleotides may be introduced into the 
aptazymes that substantially stabilize them from degradation 
in environments such as sera or urine. Similarly, enzymes 
generally have higher affinities for analytes than do 
aptamers, and be inference aptazymes. However, the analytical 
methods of the present invention do not rely on binding per 
se, but only on transient interactions. The present invention 
requires mere recognition rather that actual binding, 
providing a potential advantage of apatzymes over enzymes. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the features and 
advantages of the present invention, reference is now made to 
the detailed description of the invention along with the 
accompanying figures in which corresponding numerals in 
different figures refer to corresponding parts and in which: 
Figure 1 is a depiction of the secondary structure of the 
Group 1 theophylline-dependent (td) intron of bacteriophage T4 
(wild type) ; 

Figure 2a is a photograph of a gel showing activation of 
the G P IThlP6.131 aptamer construct, together with a graphical 
representation of the gel, of one embodiment of the present 
invention; 

Figure 2b is a photograph of a gel showing activation of 
G P ITh2P6.133 aptamer construct, together with a graphical 
representation of the gel of one embodiment of the present 
invention . 

Figure 3a depicts a portion of the P6 region of the 
Group I ribozyme joined to the anti- theophylline aptamer by a 
short randomized region to generate a pool of aptazymes of the 

present invention. 

Figure 3b is a schematic depiction of a selection 
protocol for the Group I P6 Aptazyme Pool of Fig. 3a. 
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Figure 4 is a diagram of one embodiment of the present 
invention depicting exogenous or endogenous activation of 
Group I intron splicing; 

Figure 5 is a diagram of another embodiment of the 
present invention depicting a strategy for screening libraries 
of exogenous activators; and 

Figure 6 is a diagram of an alternative embodiment of the 
present invention for screening libraries of exogenous 
activators . 



11 



DETAILED DESCRIPTION OP THE INVENTION 

While the making and using of various embodiments of the 
present invention are discussed in detail below, it should be 
appreciated that the present invention provides many 

5 applicable inventive concepts that may be embodied in a wide 

variety of specific contexts. The specific embodiments 
discussed herein are merely illustrative of specific ways to 
make and use the invention and do not delimit the scope of the 
invention . 

1 (J Def ini tions 

en 

m As US ed herein, the term "regulatable aptazyme" means an 

1 allosteric ribozyme. The kinetic parameters of the ribozyme 

S may be varied in response to the amount of an allosteric 

O effector molecule. Just as allosteric protein enzymes undergo 

yd 

lS a change in their kinetic parameters or of their enzymatic 

sasa 

3 activity in response to interactions with an effector 

u 

molecule, the catalytic abilities of regulatable aptazymes can 
similarly be modulated by effectors. Regulatable aptazymes 
transduce molecular recognition into catalysis upon 
interaction with an allosteric effector molecule that binds an 
effector portion of the regulatable aptazyme. Specifically, 
in the presence of the effector, the aptazyme splices itself 
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out of a gene in response to the effector molecule to regulate 

expression of the gene. 

As used herein, the term "aptamer" refers to an 
oligonucleotide having aptazyme activity. 

As used herein, the term "allosteric effector" or 
"allosteric effector molecule" means a substance that 
allosterically changes the kinetic parameters or catalytic 
activity of an aptazyme, and in particular a substance that 
activates self-splicing of an aptazyme. 

As used herein, the term "kinetic parameters" refers to 
catalytic activity. Changes in the kinetic parameters of a 
catalytic ribozyme produce changes in the catalytic activity 
of the ribozyme such as a change in the rate of reaction or a 
O change in substrate specificity. For example, self-splicing 

l| of an aptazyme out of a gene environment may result from a 

change in the kinetic parameters of the aptazyme. 

As used herein, the term "catalytic" or "catalytic 
activity" refers to the ability of a substance to affect a 
change in itself or of a substrate under permissive 

20 conditions. 

As used herein, the term "protein-protein complex" or 
"protein complex" refers to an association of more than one 
protein. The proteins of the complex may be associated by a 

13 



variety of means, or by any combination of means, including 
but not limited to functional, stereochemical, conformational, 
biochemical, or electrostatic association. It is intended 
that the term encompass associations of any number of 

proteins . 

As used herein the terms "protein", "polypeptide" or 
-peptide" refer to compounds comprising amino acids joined via 
peptide bonds and are used interchangeably. 

As used herein, the term "endogenous" refers to a 
substance the source of which is from within a cell. 
Endogenous substances are produced by the metabolic activity 
of a cell. Endogenous substances, however, may nevertheless 
be produced as a result of manipulation of cellular metabolism 
to, for example, make the cell express the gene encoding the 

isff substance. 
% as used herein, the term "exogenous" refers to a 

° substance the source of which is external to a cell. An 

exogenous substance may nevertheless be internalized by a cell 
by any one of a variety of metabolic or induced means known to 

those skilled in the art. 

As used herein, the term "gene" means the coding region 
of a deoxyribonucleotide sequence encoding the amino acid 

of a nrotein The term includes sequences located 
sequence of a protein. 

14 
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adjacent to the coding region on both the 5", and 3", ends 
such that the deoxyribonucleotide sequence corresponds to the 
length of the full-length mRNA for the protein. The term 
"gene" encompasses both cDNA and genomic forms of a gene. A 
genomic form or clone of a gene contains the coding region 
interrupted with non-coding sequences termed "introns" or 
"intervening regions" or "intervening sequences." Introns are 
segments of a gene which are transcribed into nuclear RNA 
(hnRNA) ; introns may contain regulatory elements such as 
enhancers. Introns are removed, excised or "spliced out" from 
the nuclear or primary transcript; introns therefore are 
absent in the messenger RNA (mRNA) transcript. The mRNA 
functions during translation to specify the sequence or order 
of amino acids in a nascent polypeptide. 

in addition to containing introns, genomic forms of a 
gene may also include sequences located on both the 5' and 3" 
end of the sequences which are present on the RNA transcript. 
These sequences are referred to as "flanking" sequences or 
regions (these flanking sequences are located 5' or 3' to the 
non-translated sequences present on the MRNA transcript) . The 
5- flanking region may contain regulatory sequences such as 
promoters and enhancers which control or influence the 
transcription of the gene. The 3' flanking region may contain 
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sequences which direct the termination of transcription, post- 
transcriptional cleavage and polyadenylation . 

DNA molecules are said to have "5' ends" and "3' ends" 
because mononucleotides are reacted to make oligonucleotides 
in a manner such that the 5' phosphate of one mononucleotide 
pentose ring is attached to the 3' oxygen of its neighbor in 
one direction via a phosphodiester linkage. Therefore, an end 
of an oligonucleotides referred to as the "5' end" if its 5' 
phosphate is not linked to the 3' oxygen of a mononucleotide 
pentose ring and as the "3' end" if its 3' oxygen is not 
linked to a 5' phosphate of a subsequent mononucleotide 
pentose ring. As used herein, a nucleic acid sequence, even 
if internal to a larger oligonucleotide, also may be said to 
have 5' and 3' ends. In either a linear or circular DNA 
molecule, discrete elements are referred to as being 
"upstream" or 5 ' of the "downstream" or 3' elements. This 
terminology reflects the fact that transcription proceeds in 
a 5' to 3' fashion along the DNA strand. 

The term "gene of interest" as used here refers to a 
gene, the function and/or expression of which is desired to be 
investigated, or the expression of which is desired to be 
regulated, by the present invention. In the present 
disclosure, the td gene of the T4 bacteriophage is an example 
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of a gene of interest and is described herein to illustrate 
the invention. The present invention may be useful in regard 
to any gene of any organism, whether of a prokaryotic or 
eukaryotic organism. 

The term "hybridize" as used herein, refers to any 
process by which a strand of nucleic acid binds with a 
complementary strand through base pairing. Hybridization and 
the strength of hybridization (i.e., the strength of the 
association between the nucleic acid strands) is impacted by 
such factors as the degree of complementary between the 
cleic acids, stringency of the conditions involved, the 
Iting temperature of the formed hybrid, and the G:C (or U:C 
for RNA) ratio within the nucleic acids. 

The terms "complementary" or "complementarity" as used 
herein, refer to the natural binding of polynucleotides under 
permissive salt and temperature conditions by base-pairing. 
For example, for the sequence "A-G-T" binds to the 
complementary sequence "T-C-A". Complementarity between two 
single-stranded molecules may be partial, in which only some 
of the nucleic acids bind, or it may be complete when total 
complementarity exists between the single stranded molecules. 
The degree of complementarity between nucleic acid strands has 
significant effects on the efficiency and strength of 
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hybridization between nucleic acid strands. This is of 
particular importance in amplification reactions, which depend 
upon binding between nucleic acids strands. 

The term "homology," as used herein, refers to a degree 
of complementarity. There may be partial homology or complete 
homology (i.e., identity). A partially complementary sequence 
is one that at least partially inhibits an identical sequence 
from hybridizing to a target nucleic acid; it is referred to 
using the functional term "substantially homologous." The 
inhibition of hybridization of the completely complementary 
sequence to the target sequence may be examined using a 
hybridization assay (Southern or Northern blot, solution 
hybridization and the like) under conditions of low 
stringency. A substantially homologous sequence or probe will 
| compete for and inhibit the binding (i.e., the hybridization) 

1 of a completely homologous sequence or probe to the target 

sequence under conditions of low stringency. This is not to 
say that conditions of low stringency are such that non- 
specific binding is permitted; low stringency conditions 
0 require that the binding of two sequences to one another be a 

specific (i.e., selective) interaction. The absence of non- 
specific binding may be tested by the use of a second target 
sequence which lacks even a partial degree of complementarity 
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(e.g., less than about 30% identity); in the absence of non- 
specific binding, the probe will not hybridize to the second 
non-complementary target sequence. When used in reference to 
a single-stranded nucleic acid sequence, the term 
"substantially homologous" refers to any probe which can 
hybridize (i.e., it is the complement of) the single-stranded 
nucleic acid sequence under conditions of low stringency as 
described. 

As known in the art, numerous equivalent conditions may 
be employed to comprise either low or high stringency 
conditions. Factors such as the length and nature (DNA, RNA, 
base composition) of the sequence, nature of the target (DNA, 
RNA, base composition, presence in solution or immobilization, 
etc.), and the concentration of the salts and other components 
(e.g., the presence or absence of formamide, dextran sulfate 
and/or polyethylene glycol) are considered and the 
hybridization solution may be varied to generate conditions of 
either low or high stringency different from, but equivalent 
to, the above listed conditions. 

As used herein the term "stringency" is used in reference 
to the conditions of temperature, ionic strength, and the 
presence of other compounds such as organic solvents, under 
which nucleic acid hybridizations are conducted. With "high 
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stringency" conditions, nucleic acid base pairing will occur 
only between nucleic acid fragments that have a high frequency 
of complementary base sequences. Thus, conditions of "weak" 
or "low" stringency are often required with nucleic acids that 
5 are derived from organisms that are genetically diverse, as 

the frequency of complementary sequences is usually less. 

Low stringency conditions comprise conditions equivalent 
to binding or hybridization at 42°C in a solution consisting 
of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH 2 PO 4 'H 2 0 and 1.85 g/1 

1(5 EDTA, pH adjusted to 7 . 4 with NaOH) , 0.1% SDS, 5X Denhardt's 

y! 

reagent (50X Denhardt's cdntains per 500 ml : 5 g Ficoll (Type 
^ 400, Pharmacia), 5 g BSA [Fraction V; Sigma]) and 100 ug/ml 

yi 

W denatured salmon sperm DNA) followed by washing in a solution 

5 comprising 5X SSPE, 01% SDS at 42°C when a probe of about 500 

lC nucleotides in length is employed. 

Numerous equivalent conditions may be employed to 
comprise low stringency conditions; factors such as the length 
and nature (DNA, RNA, base composition) of the probe and 
nature of the target (DNA, RNA, base composition, present in 
solution or immobilized, etc.) and the concentration of the 
salts and other components (e.g., the presence or absence of 
formamide, dextran sulfate, polyethylene glycol) are 
considered and the hybridization solution may be varied to 
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generate conditions of low stringency hybridization different 
from, but equivalent to, the above listed conditions. In 
addition, the art knows conditions which promote hybridization 
under conditions of high stringency (e.g., increasing the 
temperature of the hybridization and/or wash steps, the use of 
formamide in the hybridization solution, etc.). 

The term "antisense," as used herein, refers to 
nucleotide sequences that are complementary to a specific DNA 
or RNA sequence. The term "antisense strand" is used in 
reference to a nucleic acid strand that is complementary to 
tile "sense" strand. Antisense molecules may be produced by 
any method, including synthesis by ligating the gene(s) of 
interest in a reverse orientation to a viral promoter which 
permits the synthesis of a complementary strand. Once 
introduced into a cell, the transcribed strand combines with 
natural sequences produced by the cell to form duplexes. 
These duplexes then block either the further transcription or 
translation. In this manner, mutant phenotypes may also be 
generated. The designation "negative" is sometimes used in 
reference to the antisense strand, and "positive" is sometimes 
used in reference to the sense strand. 

The term also is used in reference to RNA sequences that 
are complementary to a specific RNA sequence (e.g., mRNA). 



included within this definition are antisense RNA ( "asRNA" ) 
molecules involved in genetic regulation by bacteria. 
Antisense RNA may be produced by any method, including 
synthesis by splicing the gene(s) of interest in a reverse 
orientation to a viral promoter which permits the synthesis of 
a coding strand. Once introduced into an embryo, this 
transcribed strand combines with natural mRNA produced by the 
embryo to form duplexes. These duplexes then block either the 
further transcription of the mRNA or its translation. In this 
manner, mutant phenotypes may be generated. The term 
"antisense strand" is used in reference to a nucleic acid 
strand that is complementary to the "sense" strand. The 
designation. {-) (i.e., "negative") is sometimes used in 
reference to the antisense strand with the designation (+) 
sometimes used in reference to the sense (i.e., "positive") 
strand. 

A gene may produce multiple RNA species which are 
generated by differential splicing of the primary RNA 
transcript. cDNAs that are splice variants of the same gene 
will contain regions of sequence identity or complete homology 

(representing the presence of the same exon or portion of the 
same exon on both cDNAs) and regions of complete non-identity 

(for example, representing the presence of exon "A" on cDNA I 
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wherein cDNA 2 contains exon "B" instead) . Because the two 
cDNAs contain regions of sequence identity they will both 
hybridize to a probe derived from the entire gene or portions 
of the gene containing sequences found on both cDNAs; the two 
splice variants are therefore substantially homologous to such 
a probe and to each other. 

"Transformation," as defined herein, describes a process 
by which exogenous DNA enters and changes a recipient cell. 
It may occur under natural or artificial conditions using 
various methods well known in the art. Transformation may 
rely on any known method for the insertion of foreign nucleic 
acid sequences into a prokaryotic or eukaryotic host cell. 
The method is selected based on the host cell being 
transformed and may include, but is not limited to, viral 
infection, electroporation, lipofection, and particle 
bombardment. Such "transformed" cells include stably 
transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or 
as part of the host chromosome. The term "transf ection" as 
used herein refers to the introduction of foreign DNA into 

eukaryotic cells. 

Transfection may be accomplished by a variety of means 
known to the art including, e.g., calcium phosphate-DNA co- 
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precipitation, DEAE-dextran-mediated transf ection, polybrene- 
mediated transf ection, electroporation, microinjection, 
liposome fusion, lipofection, protoplast fusion, retroviral 
infection, and biolistics. Thus, the term "stable 

5 transfection" or "stably transfected" refers to the 

introduction and integration of foreign DNA into the genome of 
the transfected cell. The term "stable transf ectant" refers 
to a cell which has stably integrated foreign DNA into the 
_ genomic DNA. The term also encompasses cells which 

l(g transiently express the inserted DNA or RNA for limited 

m periods of time. Thus, the term "transient transfection" or 

fl "transiently transfected" refers to the introduction of 

foreign DNA into a cell where the foreign DNA fails to 
integrate into the genome of the transfected cell. The 
foreign DNA persists in the nucleus of the transfected cell 
2 for several days. During this time the foreign DNA is subject 

to the regulatory controls that govern the expression of 
endogenous genes in the chromosomes. The term "transient 
transfectant" refers to cells which have taken up foreign DNA 
20 but have failed to integrate this DNA. 

As used herein, the term "selectable marker" refers to 
the use of a gene that encodes an enzymatic activity and which 
confers the ability to grow in medium lacking what would 
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otherwise be an essential nutrient (e.g., the HIS3 gene in 
yeast cells); in addition, a selectable marker may confer 
resistance to an antibiotic or drug upon the cell in which the 
selectable marker is expressed. A review of the use of 
selectable markers in mammalian cell lines is provided in 
Sambrook, J. et. a.'., Molecular Cloning: A Laboratory Manual, 
2nd ed., Cold Spring Harbor Laboratory Press, New York (1989) 
pp. 16. 9-16. 15. 

As used herein, the term "reporter gene" refers to a gene 
that is expressed in a cell upon satisfaction of one or more 
S contingencies and which, upon expression, confers a detectable 

k phenotype to the cell to indicate that the contingencies for 

5 expression have been satisfied. For example, the gene for 

Luciferase confers a luminescent phenotype to a cell when the 
gene is expressed inside the cell. In the present invention, 
the gene for Luciferase may be used as a reporter gene such 
that the gene is only expressed upon the splicing out of an 
intron in response to an effector. Those cells in which the 
effector activates splicing of the intron will express 
Luciferase and will glow. 

As used herein, the term "vector" is used in reference to 
nucleic acid molecules that transfer DNA segment (s) from one 
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cell to another. The term "vehicle" is sometimes used 
interchangeably with "vector." 

The term "vector" as used herein also includes expression 
vectors in reference to a recombinant DNA molecule containing 
a desired coding sequence and appropriate nucleic acid 
sequences necessary for the expression of the operably linked 
coding sequence in a particular host organism. Nucleic acid 
sequences necessary for expression in prokaryotes usually 
include a promoter, an operator (optional), and a ribosome 
binding site, often along with other sequences. Eukaryotic 
cells are known to utilize promoters, enhancers, and 
J termination and polyadenylation signals. 

As used herein, the term "amplify", when used in 
reference to nucleic acids refers to the production of a 
large number of copies of a nucleic acid sequence by any 
method known in the art. Amplification is a special case of 
nucleic acid replication involving template specificity. 
Template specificity is frequently described in terms of 
"target" specificity. Target sequences are "targets" in the 
sense that they are sought to be sorted out from other nucleic 
acid. Amplification techniques have been designed primarily 
for this sorting out. 



m 
m 



O 



26 



As used herein, the term "primer" refers to an 
oligonucleotide, whether occurring naturally as in a purified 
restriction digest or produced synthetically, which is capable 
of acting as a point of initiation of synthesis when placed 
under conditions in which synthesis of a primer extension 
product which is complementary to a nucleic 'd strand is 
induced, (i.e., in the presence of nucleotides and an inducing 
agent such as DNA polymerase and at a suitable temperature and 
pH) . The primer may be single stranded for maximum efficiency 
in amplification but may alternatively be double stranded. If 
double stranded, the primer is first treated to separate its 
strands before being used to prepare extension products. The 
primer must be sufficiently long to prime the synthesis of 
extension products in the presence of the inducing agent. The 
exact lengths of the primers will depend on many factors, 
including temperature, source of primer and the use of the 
method. 

As used herein, the term "probe" refers to an 
oligonucleotide (i.e., a sequence of nucleotides), whether 
occurring naturally as in a purified restriction digest or 
produced synthetically, recombinant^ or by PCR amplification, 
which is capable of hybridizing to another oligonucleotide of 
interest. A probe may be single-stranded or double-stranded. 
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Probes are useful in the detection, identification and 
isolation of particular gene sequences. It is contemplated 
that any probe used in the present invention will be labeled 
with any "reporter molecule," so that is detectable in any 
5 detection system, including, but not limited to enzyme (e.g. 

ELISA, as well as enzyme-based histochemical assays) , 
fluorescent, radioactive, and luminescent systems. It is not 
intended that the present invention be limited to any 
particular detection system or label. 
1§ As used herein, the term "target" when used in reference 

JJ to the polymerase chain reaction, refers to the region of 

K nucleic acid bounded by the primers used for polymerase chain 

S reaction. Thus, the "target" is sought to be sorted oat from 

□ other nucleic acid sequences. A "segment" is defined as a 

1|2 region of nucleic acid within the target sequence, 

g As used herein, the term "polymerase chain reaction" 

("PCR") refers to the method of K.B. Mullis U.S. Patent Nos . 
4,683,195, 4,683,202, and 4,965,188, hereby incorporated by 
reference, which describe a method for increasing the 
concentration of a segment of a target sequence in a mixture 
of genomic DNA without cloning or purification. This process 
for amplifying the target sequence consists of introducing a 
large excess of two oligonucleotide primers to the DNA mixture 
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containing the desired target sequence, followed by a precise 
sequence of thermal cycling in the presence of a DNA 
polymerase. The two primers are complementary to their 
respective strands of the double stranded target sequence. 
5 To effect amplification, the mixture is denatured and the 

primers then annealed to their complementary sequences within 
the target molecule. Following annealing, the primers are 
extended with a polymerase so as to form a new pair of 
complementary strands. The steps of denaturation, primer 
l(g annealing and polymerase extension can be repeated many times 

J (i.e., denaturation, annealing and extension constitute one 

£ "cycle"; there can be numerous "cycles") to obtain a high 

® concentration of an amplified segment of the desired target 

W 

1 sequence. The length of the amplified segment of the desired 

if} target sequence is determined by the relative positions of the 

£ primers with respect to each other, and therefore, this length 

0 is a controllable parameter. By virtue of the repeating 

aspect of the process, the method is referred to as the 
"polymerase chain reaction" (hereinafter "PCR") . Because the 
desired amplified segments of the target sequence become the 
predominant sequences (in terms of concentration) in the 
mixture, they are said to be "PCR amplified". 
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With PCR, it is possible to amplify a single copy of a 
specific target sequence in genomic DNA to a level detectable 
by several different methodologies (e.g., hybridization with 
a labeled probe; incorporation of biotinylated primers 
followed by avidin-enzyme conjugate detection; incorporation 
of 32p-i a beled deoxynucleotide triphosphates, such as DCTP or 
DATP, into the amplified segment) . In addition to genomic 
DNA, any oligonucleotide sequence can be amplified with the 
appropriate set of primer molecules. In particular the 
amplified segments created by the PGR process itself are, 
themselves, efficient templates for subsequent PGR 

amplifications . 

The invention is now described in detail with the use of 
the td intron from T4 bacteriophage (Fig. D for illustrative 
purposes. The description is not intended to limit the scope 
of the invention or the claims appended hereto. 
S Figure 1 depxVs the secondary structure of the td intron 

from bacteriophage X^GenBank # M12742) , wherein -td" means 
theophylline-dependentX The td intron was selected to 
illustrate the present inve\ion because, among other things, 
Uational analysis has identiHed regions of this intron that 
can be engineered and modified. \e Salvo, et al., Deletion- 
tolerance and trans-splicing of theVcteriophage T4 intron. 
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analysis of theV-1.6a region. „. Mo,. Bio,. 211. 53,-549 
(1990 , and Salvo et^l., The P2 element of the td intron is 
dispensable despite \normal role in splicing. J. 
Bio,. 267, 28,5-2848 (1 9 V Thus, aptamer domains or pools 
may be engineered into the \ intron . 

A n anti-theophylline aptamer has been described by R.D. 

»t al High-resolution molecular discrimination by 

Jenison, et al - r mgu 

RBA . science 263, 1125-1429 ,1994, . In the present 
invention, the anti-theophylline aptamer was mounted in two 
iocations in the td intron, shown by the shaded portions of 

4- hp termini of Pi and the other 
Fig. 1. One location was at the termini o 

. th . n p6 Th e PI constructs may enable ligand- 
location was within Pb. me 

dependent conformational changes that alter the conformation 
or agister of the u:G base pair which is critical for 
splicing. The PS region was selected because mutational 
analysis indicated that deletion of the P6 stem destabilizes 

the intron. 

Referring now to Figs 2a and 2b, in the present invention 
P6 constructs were made so that Group X splicing was activated 
by the presence of theophylline in the range of approximately 
9 t o 19 fold over constructs grown in the absence of 
theophylline, as described in the examples below: 
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The following examples illustrate the present invention 
in the td gene system of T4. For a full understanding of the 
examples, refer to Figures 2a and 2b. The examples are 
provide for illustrative purposes and do not limit the scope 
of the present invention or the scope of the appended claims. 



Example 1: GpIThlP6 
Engineering of Group I Aptazymes 

The first example illustrates how to make an aptazyme 
construct and demonstrates self-splicing of the aptazyme out 
of a gene in response to an effector molecule. 

mnai-riint-.iQP nf a re gnl stable aptazyme 

Oligos GpIWt3.129: 5 1 -TAA TCT TAC CCC GGA ATT ATA TCC AGC 
TGC ATG TCA CCA TGC AGA GCA GAC TAT ATC TCC AAC TTG TTA AAG 
CAA GTT GTC TAT CGT TTC GAG TCA CTT GAC CCT ACT CCC CAA AGG 
GAT AGT CGT TAG- 3 ' (SEQ ID NO: 1) and GpIThlP6 . 131 : 5 • -GCC TGA 
GTA TAA GGT GAC TTA TAC TTG TAA TCT ATC TAA ACG GGG AAC CTC 
TCT AGT AGA CAA TCC CGT GCT AAA TTA TAC CAG CAT CGT CTT GAT 
GCC CTT GGC AGA TAA ATG CCT AAC GAC TAT CCC TT-3' (SEQ ID NO: 
2) were annealed and extended in a 30 »1 reaction containing 
100 pmoles of each oligo, 250 mM Tris-HCl (pH 8.3), 40 mM 
MgCl 2 , 250 mM NaCI, 5 mM DTT, 0.2 mM each dNTP, 45 units of 
AMV reverse transcriptase (RT: Amersham Pharmacia Biotech, 

32 



Inc., Piscataway, NJ) at 37° C for 30 minutes. The extension 
reaction was diluted 1 to 50 in H 2 0. 

A PCR reaction containing 1 yul of the extension dilution, 
500 mM KC1, 100 mM Tris-HCl, (pH 9.0), 1% Triton® x-100, 15 mM 
5 MgCl 2 , 0.4 /M of GpIWtl.75: 5 ' -GAT AAT ACG ACT CAC TAT AGG GAT 

CAA CGC TCA GTA GAT GTT TTC TTG GGT TAA TTG AGG CCT GAG TAT 
AAG GTG-3 ' (SEQ ID NO: 3), 0.4 yuM of GpIWt4.89: 5 ' -CTT AGC TAC 
AAT ATG AAC TAA CGT AGC ATA TGA CGC AAT ATT AAA CGG TAG CAT 
TAT GTT CAG ATA AGG TCG TTA ATC TTA CCC CGG AA-3 1 (SEQ ID 
1(2 NO:4), 0.2 mM each dNTP and 1.5 units of Taq polymerase 

m (Promega, Madison, WI) was thermocycled 20 times under the 

sssss 

CH following regime: 94° C for 30 seconds, 45° C for 30 seconds, 

in 

fB 72° C for 1 minute. The PCR reaction was precipitated m the 

S 

O presence of 0.2 M NaCl and 2.5 volumes of ethanol and then 

if quantitated by comparison with a molecular weight standard 

O using agarose gel electrophoresis. 

The aptazyme construct was transcribed in a 10 /zl high 
yield transcription reaction (AmpliScribe from Epicentre, 
Madison, WI . The reaction contained 500 ng PCR product, 3.3 
20 pmoles of P 32 [a-32 P]UTP (3000 Camel), IX AmpliScribe 

transcription buffer, 10 mM DTT, 7 . 5 mM each NTP, and 1 /zl 
AmpliScribe T7 polymerase mix. The transcription reaction was 
incubated at 37° C for 2 hours. One unit of RNase free-DNase 
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was added and the reaction returned to 37° C for 30 minutes. 
The transcription was then purified on a 6% denaturing 
polyacrylamide gel to separate the full length RNA from 
incomplete transcripts and spliced products, eluted and 
quantitated spectrophotometrically . 
Tn vitro Assay 

The RNA (4 pmoles/12 //l H 2 0) was heated to 94° C for 1 
minute then cooled to 37° C over 2 minutes in a thermocycler . 
The RNA was divided into 2 splicing reactions (9 £il each) 
containing 100 mM Tris-HCl (pH 7.45), 500 mM KC1 and 15 mM 
MgCl 2 , plus or minus theophylline (2 mM) . The reactions were 
immediately placed on ice for 30 minutes. GTP (1 mM) was 
added to the reactions (final volume of 10 fxl) and the 
reactions were incubated at 37° C for 2 hours. 

The reactions were terminated by the addition of stop dye 
(10 /ul) (95% formamide, 20 mM EDTA, 0.5% xylene cyanol, and 
0.5% bromophenol blue). The reactions were heated to 70° C 
for 3 minutes and 10 /xl was electrophoresed on a 6% denaturing 
polyacrylamide gel. The gel was dried, exposed to a phosphor 
screen and analyzed using a Molecular Dynamics Phosphorimager 

(Sunnyvale, CA) . 

Activation was determined from the amount of circular 
intron in each reaction. Circularized introns migrate slower 
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than linear RNA and can be seen as the bands above the dark 
bands of linear RNA in the +Theo lanes of" the gels of Figs. 2a 
and. 2b. 

Example 2: GpIP6Thpool 
In vitro Selection of Group I Aptazymes 

Example 2 illustrates how to generate a population of 
aptazymes so that there is variation in the nucleotide 
sequence of the aptamers . This example also illustrates how 
to select for phenotypes that are responsive to an effector 
molecule from among that population of aptazymes. 

rnnstruct inn of Pool 

The construction of the pool was similar to the 
construction of the individual engineered aptazyme constructs. 
Oligos G P IWt3.129 and GpIThP6pool: 5 • -GCC TGA GTA TAA GGT GAC 
TTA TAC TAG TAA TCT ATC TAA ACG GGG AAC CTC TCT AGT AGA CAA 
TCC CGT GCT AAA TN(1-4)A TAC CAG CAT CGT CTT GAT GCC CTT GGC 
AGN (1-4) TAA ATG CCT AAC GAC TAT CCC TT-3' (SEQ ID N0:5) were 
extended in the same manner as above. The extension reaction 
was diluted and used as template for a PCR reaction. The PCR 
reaction was similar to the reaction described with the 
following exceptions: the volume was doubled and GpIWt4.8 9 was 
replaced with GplMutG.101: 5 ' -CTT AGC TAC AAT ATG AAC TAA CGT 
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AGC ATA TGA CGC AAT ATT AAA CGG TAG TAT TAT GTT CAG ATA AGG 
TCG TTA ATC TTA CCC CGG AAT TCT ATC CAG CT-3' (SEQ ID NO:6) in 
which there is an G to A mutation at the terminal residue of 
the intron. The pool had a diversity of 1.16X10' molecules. 
RNA was made as described above. 
rn vitro N oqaHvfi Selection 

The RNA (10 pmoles/70 y.1 H 2 0) was heated to 94° C for 1 
minute then cooled to 37° C over 2 minutes in a thermocycler . 
The splicing reaction (90 „!> contained 100 mM Tris-HCl ( P H 
7.45), 500 mM KC1 and 15 mM MgCl 2 . The reaction was 
immediately placed on ice for 30 minutes. GTP (1 mM) was 
added to the reaction (final volume of 100 „1) and the 
reaction was incubated at 37° C for 20 hours. The reaction 
was terminated by the addition 20 mM EDTA and precipitated in 
the presence of 0.2 M NaCl and 2 . 5 volumes of ethanol . The 
reaction was resuspended in 10 »1 H 2 0 and 10 „1 stop dye and 
heated to 70° C for 3 minutes and was electrophoresed on a 6% 
denaturing polyacrylamide gel with Century™ Marker ladder 
(Ambion, Austin, TX) . The gel was exposed to a phosphor 
screen and analyzed. The unreacted RNA was isolated from the 
gel, precipitated and resuspended in 10 /zl of H 2 0. 
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Pr.g-1 l-i vp. Selection 

The RNA (5 fil of negative selection) was heated to 94° C 
for 1 minute then cooled to 37° C over 2 minutes in a 
thermocycler. The positive splicing reaction (45 /zl) 
contained 100 mM Tris-HCl (pH 7.45), 500 mM KC1, 15 mM MgCl 2 
and lmM theophylline. The' reaction was immediately placed on 
ice for 30 minutes. GTP (1 mM) was added to the reaction 
(final volume of 50 n±) and the reaction was incubated at 37° 
C for 1 hour. The reaction was terminated by the addition of 
stop dye, heated to 70° C for 3 minutes and was 
electrophoresed on a 6% denaturing polyacrylamide gel with 
% Century™ Marker ladder. The gel was exposed to a phosphor 

1 screen and analyzed. The band corresponding to the linear 

intron was isolated from the gel and precipitated and 
1§& resuspended in 20 fJ-1 H 2 0. 

J Am p! i f ic? Hon and Trayisr.ri ption 

0 The R NA was reverse transcribed in a reaction containing 

250 mM Tris-HCl (pH 8.3), 375 mM KC1, 15 mM, MgCl 2 , 0.1 M DTT, 
0.4 mM of each dNTP 2 fM GpIMutG.101 and 400 units of 
Superscript II reverse transcriptase (Gibco BRL, Rockville, 
MD) . The cDNA was then PCR amplified, transcribed and gel 
purified as described above. 
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Figure 3a depicts the critical residues of the P6 region 
f the Group I ribozyme joined to the anti-theophylline 
aptamer by a short randomized region to generate a pool of 
aptazymes of the present invention. The residues shown in 
bold in Fig. 4a are the P6 critical residues, and the faded 
residues shown in Fig. 4a are the anti-theophylline aptamer. 
The randomized regions are designated in Fig. 4a as "Nl-4". 
Approximately 40 random sequence residues are introduced into 
the Nl-4 region of the construct to synthesize a pool of 
aptazymes, referred to herein as a communication module pool. 
Figure 3b shows a selection protocol for the Group I P6 
| Aptazyme Pool of Fig. 3a. Positive and negative selections 

are made in vitro to select Group I aptazymes that are 
dependent on theophylline. The selections are described above 
in Example 2 for a specific embodiment of the present 
invention - 

A communication module pool can be transformed with the 
selected aptazymes. The best theophylline-dependent Group I 
aptazymes that have been derived by any of the methods 
described herein may undergo further selection by partially 
randomizing their sequences and selecting for improved 
performance . 
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Strategies similar to those depicted in Figs 3a and 3b 
may be used to develop aptazymes and aptamers dependent on any 
desired effector molecule. See generally G.A. Soukup, et al . , 
Engineering precision RNA molecular switches. Proc. Natl. 
Acad. Sci. U.S.A. 96, 3584-3589 (1999) and M. Koizumi, et al . , 
Allosteric selection of ribozymes that respond to the second 
messengers cGMP and cAMP. Nature Struct. Biol. 6, 1062-1071 
(1999). Positive and negative in vitro selection such as 
depicted in Fig. 3b are described above in Example 2 for a 
specific embodiment of the present invention. The 
optimization strategies described herein yield Group I 
aptazymes that are highly dependent on small molecule 
effectors . 

Figure 4 is a diagrammatic representation of one 
embodiment of the exogenous or endogenous activation of Group 
I intron splicing is depicted. A gene of interest 10 is fused 
to a reporter gene 12 such as luciferase or beta- 
galactosidase, which also contains the group I intron (td) 14. 
Splicing-out of the Group I intron is induced by an endogenous 
effector molecule 16. Alternatively, splicing-out of the 
Group I intron may be induced by an exogenous effector 
molecule 18. Activation of the aptazyme and auto-excision of 
the intron results in expression of the reporter gene encoded 
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protein 20 that is detect by, e.g., fluorescence 22 or any 
other desired detectable reaction. 

Figure 5 is a diagram of another embodiment of the 
present invention. Libraries of candidate exogenous 

5 activators 30 can be generated from a randomized aptazyme pool 

indicated by random loop E^. As in the embodiment of Fig. 4, 
a reporter gene 12 is expressed in cells where the exogenous 
activator 30 activates the aptazyme to release the intron, 
which may contain a random loop 32, from the gene. In this 
lP embodiment, the reaction occurs within cells which are then 

sorted 34 based on a chromogenic reaction or emission 22, or 
may even be isolated by, e.g, statistical cell separation 

pi 

m cloning. As will be known to those of skill in the art of 

enzymatic oligonucleotides any number of current and future 
effector molecule libraries may be used with the present 
invention. 

Figure 6 depicts an alternative embodiment for screening 
libraries of exogenous activators. Group I introns with 
length polymorphisms are induced into the construct by trans- 
splicing with an independent oligonucleotide. Libraries of 
candidate exogenous activators 30 can be generated from a 
randomized aptazyme pool indicated by random loop E^. As in 
the embodiment of Fig. 5, a reporter gene 12 is expressed in 
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cells where the exogenous activator 30 activates the aptazyme 
to release the intron, which may contain a random loop 32, 
from the gene. In this embodiment, the reaction occurs within 
the intron 14 and an independent oligonucleotide 36 by a 
trans-splicing reaction and extraction step 38. Extracted 
trans-spliced intron reporter gene constructs are then 
amplified by, e.g., polymerase chain reaction in step 40, 
followed by transformation of cells with the transpliced 
construct at step 42. Transformation of the transpliced 
construct may be performed by those of skill in the art with 
either a negative or positive selection scheme for 
identification of the trans-spliced gene. 

All publications mentioned in the above specification are 
hereby incorporated by reference. Modifications and 

variations of the described compositions and methods of the 
invention will be apparent to those skilled in the art without 
departing from the scope and spirit of the invention. 
Although the invention has been described in connection with 
specific embodiments, it should be understood that the 
invention as claimed should not be unduly limited to such 
specific embodiments. Indeed, various modifications of the 
described compositions and modes of carrying out the invention 
which are obvious to those skilled in molecular biology or 
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related arts are intended to be within the scope of the 
following claims. 
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